Quantum Finite State Transducers 



Rusins Freivalds^ and Andreas Winter^ 

^ Institute of Mathematics and Computer Science, University of Latvia, Raina 
bulvaris 29, LV-1459, Riga, Latvia. Email: rusins@paul.cclu.lv. 

^ Department of Computer Science, University of Bristol, Merchant Venturers 
Building, Woodland Road, Bristol BS8 lUB, United Kingdom. Email; 
winterOcs . br is . ac . uk. 



Summary. We introduce quantum finite state transducers (qfst), and study the 
class of relations which they compute. It turns out that they share many features 
with probabilistic finite state transducers, especially regarding undecidability of 
emptiness (at least for low probability of success). However, like their 'little broth- 
ers', the quantum finite automata, the power of qfst is incomparable to that of their 
probabilistic counterpart. This we show by discussing a number of characteristic 
examples. 



1 Introduction and definitions 

The issue of this work is to introduce and to study the computational model of 
quantum finite state transducers. These can be understood as finite automata 
with the addition of an output tape which compute a relation between strings, 
instead of a decision (which we read as a binary valued function). After the 
necessary definitions, the relation to quantum finite automata is clarified 
(section ||), then decidability questions are addressed (section^): it is shown 
that emptiness of the computed relation is undecidable both for quantum and 
probabilistic transducers. However, the membership problem for a specific 
output is decidable. Next, the relation between deterministic and probabilistic 
transducers is explored (section^), and in section ^ quantum and probabilistic 
transducers are compared. 

We feel our extension of quantum automata studies to this new model 
justified by the following quote from D. Scott 

'The author (along with many other people) has come recently to the 
conclusion that the functions computed by the various machines are 
more important - or at least more basic - than the sets accepted by 
these devices. (...) In fact by putting the functions first, the relation- 
ship between various classes of sets becomes much clearer'. 

We start be reviewing the concept of probabilistic finite state transducer. 
For a finite set X we denote by X* the set of all finite strings formed from 
X, the empty string is denoted e. 
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Definition 1 A probabilistic finite state transducer (pfst) is a tuple T = 
(Q, X*!, ^2, /, go, Qacc, Qrej); where Q is a finite set of states, Si, 1^2 is 
the input/output alphabet, cjo d Q is the initial state, and QaccQrej C Q 
are (disjoint) sets of accepting and rejecting states, respectively. (The other 
states, forming set Qnon, o,re called nan halting). The transition function 
V : Si X Q ^ Q is such that for all a £ Si the matrix {Va)qp is stochastic, 
and fa.Q^S2 is the output function. If all matrix entries are either or 
1 the machine is called a deterministic finite state transducer (dfst) . 

The meaning of this definition is that, being in state q, and reading input 

symbol a, the transducer prints fa{q) on the output tape, and changes to state 
p with probability (14)gp, moving input and output head to the right. After 
each such step, if the machine is found in a halting state, the computation 
stops, accepting or rejecting the input, respectively. 

To capture this formally, we introduce the total state of the machine, 
which is an element 

(PNON,PACC,Prej) € £\Q X S^) ® i\S;) ® i\{RE3}), 

with the natural norm 

||(PNON,PACC,Prej)|| = Hi^NONlll + Hi^ACcUl + IPrejl- 

At the beginning, the total state is {{qo, e), 0, 0) (where we identify an clement 
of Q X S2 with its characteristic function). The computation is represented 
by the (linear extensions of the) transformations 

Ta ■■ ((g,w),PACC,Prej) ^ XI i^'^)qpP, wfa{q) j ,PkcC^P'rei 1 . 



of the total state, for a G Si, with 

-Pacc i^) 



(Pacc{x) + EpeQ,,,(^a)9p if a; = wfa{q), 
\-PACc(a;) else, 



and p^^j = prej + T,peQr.i (^"^^f 

For a string xi .. .Xn the map T^ is just the concatenation of the T^^. 
Observe that all the Ta conserve the probability. 

Implicitely, we add initial and end marker symbols (|, $) at the input, 
with additional stochastic matrices Vf and V$, executed only at the very 
beginning, and at the very end. We assume that V$ puts no probability outside 

Qacc U Qrej • 

By virtue of the computation, to each input string v ^ S^ there corre- 
sponds a probability distribution T{-\v) on the set S2 U {REJ}: 



r(REJ|^;) :=rj,s((go,e),O,0)[REJ] 
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is the probability to reject the input ?j, whereas 



T{w\v) :=rj„$((go,e),O,0)[H 



is the probabihty to accept, after having produced the output w. 

Definition 2 LetTZ C ^ ^2- 

For a > 1/2 we say that T computes the relation TZ with probability a if 
for all V, whenever {v,w) £ TZ, then T{w\v) > a, and whenever {v,w) ^ TZ, 
then T(w\v) <l~a 

For < a < 1 we say that T computes the relation TZ with isolated 
cutpoint a if there exists e > such that for all v, whenever (w, w) G TZ, then 
T(w\v) > a + e, but whenever (v, w) ^ TZ, then T{w\v) < a — e. 

The following definition is modelled after the ones for pfst for quantum finite 
state automata 0: 

Definition 3 A quantum finite state transducer (qfst) is a tuple T — {Q, Si, 
^2, /, 90; Qacc, Qroj); where Q is a finite set of states, Si, S2 is the in- 
put/output alphabet, qo € Q is the initial state, and QaccQrcj C Q are 
(disjoint) sets of accepting and rejecting states, respectively. The transition 
function V : Si x Q Q is such that for all a Cz Si the matrix {Va)qp is 
unitary, and fa'-Q^ ^2 is the output function. 

Like before, implicitely matrices and V$ are assumed, V$ carrying no ampli- 
tude from Qnon to outside Qacc U Qrej- The computation proceeds as follows: 
being in state q, and reading a, the machine prints fa{q) on the output tape, 
and moves to the superposition Va\q) ~ J2p{^a)qp\p) of internal states. Then 
a measurement of the orthogonal decomposition -Enon ® ^/acc ® E^ej (with the 
subspaces Ei = span Qi C £'^{Q), which we identify with their respective 
projections) is performed, stopping the computation with accepting the in- 
put on the second outcome (while observing the output), with rejecting it on 
the third. 

Here, too, we define total states: these are elements 



with norm 

||(|'0NON),fACC,Proj)|| = |||'0NOn)||2 + ||-PaCc||i + IProjl- 

At the beginning the total state is (|<;o) <8i |e),0,0), the total state transfor- 
mations, for 



(|V'NON),PACC,Prej) £ ^'(Q X S^) ® £\S;) ® e\{REJ}) 





weS2 
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are (for a £ Si) 



Ta : (|V),PACC,Prcj) ^ i^nonJ^Kl?) K/a (?)) , P^CC . 



where = I]^^ ag«,|w/a(g)), and 



E 



Va\q) 



q,w s.t. x—wfaio) 



Pre'} ~ -Prcj 



^rcj5]K|'?)0K/a(g)) 



Observe that the Ta do not exactly preserve the norm, but that there is a 
constant 7 such that ||Ta(X)|| < 7||X|| for any total state X. Quite straight- 
forwardly, the distributions T{-\v) are defined, and so are the concepts of 
computation with probability a or with isolated cutpoint a. 

Observe also that we defined our model in closest possible analogy to 
quantum finite automata . This is of course to be able to compare qfst to 
the latter. In principle however other definitions are conceivable, e.g. a mixed 
state computation where the Ta are any completely positive, trace preserving, 
linear maps (the same of course applies to quantum finite automata!). We 
defer the study of such a model to another occasion. 

Notice the physical benefits of having the output tape: whereas for finite 
automata a superposition of states means that the amplitudes of the various 
transitions are to be added, this is no longer true for transducers if we face 
a superposition of states with different output tape content. I.e. the entangle- 
ment of the internal state with the output may prohibit certain interferences. 
This will be a crucial feature in some of our later constructions. 



2 Quantum Finite Automata and Quantum 
Transducers 

The definition of qfst is taylored in such a way that by excluding the output 
tape and the output function, we get a quantum finite automaton. One, 
however, with distinct acceptance and rejection properties, as compared to 
the qfst. Nevertheless, the decision capabilities of qfst equal those of quantum 
finite automata: 

Theorem 4 A language L is accepted by a 1-way quantum finite automaton 
with probability bounded away from 1/2 if and only if the relation L x {0} U 
L X {1} is computed with isolated cutpoint. 
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Proof. First observe that for finite automata (probabilistic and quantum), 
recognizability with an isolated cutpoint is equivalent to recognizability with 
probability bounded away from 1/2 (by "shifting the cutpoint": just add in 
the |-step possibilities to accept or reject right away with certain probabili- 
ties). We have to exhibit two constructions: 

Let there be given a quantum finite automaton. We may assume that it 
is such that V$ is a permutation on Q. 

This can be forced by duplicating each q e Qacc U Qroj by a new state 
q', and modifying the transition function as follows: denote by a the map 
interchanging q with q' for q ^ Qnon, and being the identity on q,q' for 
Q € Qnon- Define a unitary U such that for q G Qnon 

u\q) = j2in),p\'^p), 
p 

and U\q) = \q) ior: q e Qacc U Qrej- Now let 

V{:=UVx, Vi:=a, := UVaU-\ 

It is easily checked that this automaton behaves exactly like the initial one. 

Construct a qfst as follows: its states are QUQ, with Q = {q: q€ Qacc U 
Qrej} being the accepting states, and no rejecting states. Let the transition 
function be W with 

Wa\q) = Va\q) for q € Q„on, but 

Wa\q) = \q) for q e Qacc U Qrej- 

Since V$ is the permutation tr on Q, we may define 



W^\q) 



\aq) for aq G Qacc U Qrej, 

I erg') for uq e Qnon- 



Finally, let the output function be (for q G Q) 



jo for g e Qacc, jO for erg e Qacc, 

fail) = \, f ^ r, -^^W = i 1 f ^r, 

[1 forgsQrej, [1 forcrgeQrej, 

and e in all other cases. It can be checked that it behaves in the desired way. 

Given a qfst, construct a quantum finite automaton as follows: its states 
are Q x S^*, where the second component represents the tape content up 
to i = 1 + maxa^g l/a(g)| many symbols. Initial state is (go, e). Observe that 
by definition of the Tq amplitude that once is shifted onto output tapes of 
length larger than 1 is never recovered for smaller lengths. Hence we may as 
well cut such branches by immediate rejection: the states in Q x 17^ are all 
rejecting, and so are (QaccUQrej) x {!}• The accepting states are Qacc x {0}. 
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The transition function is partially defined by 

Wa\q,x) Y.^Va)qp\p,xfa{q)), x£S^{e}, 
peg 

(for a = $ this is followed by mapping |p, e) to a rejecting state, while leaving 
the other halting states alone), i.e. the automaton performs like the qfst on 
the elements of Q, and uses the second component to simulate the output 
tape. We think of Wa being extended in an arbitary way to a unitary map. 
One can check that this construction behaves in the desired way. □ 

3 Decidability questions 

As is well known, the emptiness problem for the language accepted by a de- 
terministic (or nondeterministic) finite automaton is decidable. Since the lan- 
guages accepted by probabilistic and quantum finite automata with bounded 
error are regular (|J^, these problems are decidable, too. 

For finite state transducers the situation is more complicated: In jsj it is 
shown that the emptiness problem for deterministic and nondeterministic fst 
is decidable. In contrast we have 

Theorem 5 The emptiness problem for pfst computing a relation with prob- 
ability 2/3 is undecidable. 

Likewise, the emptiness problem for qfst computing a relation with prob- 
ability 2/3 is undecidable. 

Proof. By reduction to the Post Correspondence Problem: let an instance 
{vi,... ,Vk), {wi,... ,Wk) of PCP be given (i.e. Vi,Wi G It is to be 

decided whether there exists a sequence ii, . . . , i„ (n > 0) such that 

I'll • • • Vi^ = Wi^ - ■■ Wi^ . 

Construct the following qfst with input alphabet {1,... ,fc}: it has states 
qo,qv,qw, and grej. The initial transformation produces a superposition of 
qv,qiu,qrc3, each with amplitude l/VS. The unitaries Ui are all identity, but 
the output function is defined as fi{qx) = Xi, for x G {v,w}. The endmarker 
maps qv,qw to accepting states. It is clear that ii, . . . ,i„ is a solution iff 
(ii . . .in,Vi-^ ■ • • Vi^) is in the relation computed with probability 2/3 (the 
automaton is easily modified so that it rejects when the input was the empty 
word, in this way we force n > 0). 

By replacing the unitaries by stochastic matrices (with entries the squared 
moduli of the corresponding amplitudes) the same applies to pfst. 

Since it is well known that PCP is undecidable, it follows that there 
can be no decision procedure for emptiness of the relation computed by the 
constructed pfst, or qfst, respectively. □ 
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Remark 6 Undecidable questions for quantum finite automata were noted 
first for "1^-way" automata, i.e. ones which move only to the right on their 
input, hut may also keep their position on the tape. In ^ it is shown that 
the equivalence problem for these is undecidable. The same was proved for 
l-way-2-tape quantum finite automata in 

Conjecture 7 The emptiness problem for probabilistic and quantum fst com- 
puting a relation with probability 0.99 is decidable. 

The emptiness problem for probabilistic and quantum fst computing a re- 
lation with a single-letter input alphabet, with probability 1/2 + e is decidable. 

To prove this, we would like to apply a packing argument in the space of 
all total states, equipped with the above metric. However, this fails because 
of the infinite volume of this space (for finite automata it is finite, see 
and 1^]). In any case, a proof must involve the size of the gap between the 
upper and the lower probability point, as the above theorem shows that it 
cannot possibly work with gap 1/3. 
Still, we can prove: 

Theorem 8 // the relation TZ is computed by a pfst or a qfst with an isolated 
outpoint, then Range(7?.) = {y : 3x {x,y) € TZ} is a recursive set (so, for 
each specific output, it is decidable if it is ever produced above the threshold 
probability) . 

Proof: Let the cutpoint be a, with isolation radius S, and let y — yi . . . y„ S 

Si. 

Define Y = {yi ■ . - yi : < i < n}, the set of prefixes of y. Consider the 
output^ -truncated total state, which is an element 

(|V^),PAcc,Prcj) e f{Q X r) ® e\Y)®£\{Rm) 

c f{Q X u;)®i\u;)®f{{REj}). 

It is obtained from PACcProj) - with \^p) = Y.q,w^qM ® k> " by 
defining 

1-0)= aq^\q)(^\w), 
qeQ,weY 

Pacc = Pacc\y, 

Prcj = Prcj + Y + Y P^CC{W). 

Let us denote this transformation by J. Now observe that in the total state 
evolution of the qfst probability once put outside Y never returns, and like- 
wise, amplitude once put outside Q x Y never returns (compare proof of 
theorem p. Formally, this is reflected in the relation 

JTabi\^),PACC,Pici) = JTbJTa{\-lp) , Pacc, Prcj)- 
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Hence, if we want to know if T{y\x) > a-\- 5 for some x, we may concentrate 
on the space of output-j/-trmicated total states, which is finite dimensional, 
and its transformation functions Ta = JTa- 

It is easily seen that there is a constant 7 such that for all truncated total 
states s, t and all w ^ SI 

||T„s-T;,t|| <-f\\s-t\\. 
Hence, for x,x',w £ SI, if 

\\f^M ® |e),0,0) - ft,.(|go) ® k),O,0)|| < 5/7, 

then 

WTf^MM ® |e),O,0) -r^,,„$(|go) |e),0,0)|| < ,5. 

Because of the cutpoint isolation we find that either both or none of {x,y), 
{x',y) is in TZ. Now, because of compactness of the set of truncated total 
states reachable from the starting state, it follows that there is a constant 
c > 1 such that for all x G Si of length > c one can write x — vxqw, with 
< |xo| < c, such that 

\\ft.ccoi\qo) ® |e),0,0) -Tj^dgo) ® 0,0)11 < S/-f. 

Hence 

WTfMko) ® |e),O,0) - fj,„$(|(Zo) ® k),O,0)|| < 6, 

and thus, if x had produced y with probability at least a + S, so had the 
shorter string vw. This means that we only have to consider input strings of 
length up to c to decide whether y G Range(7?.). 
Obviously, this reasoning applies to pfst, too. □ 

4 Deterministic vs. Probabilistic Transducers 

Unlike the situation for finite automata, pfst are strictly more powerful than 
their deterministic counterparts: 

Theorem 9 For arbitrary £ > the relation 

7ei = {(0'"l'",2'"):m>0} 

can be computed by a pfst with probability 1 — £. It cannot be computed by a 
dfst. 

Proof: The idea is essentially from [Q : for a natural number k choose initially 
an alternative J S {0, . . . , k—l}, uniformly. Then do the following: repeatedly 
read k O's, and output j 2's, until the I's start (remember the remainder 
modulo fc), then repeatedly read k I's, and output k — j 2's. Compare the 
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remainder modulo k with what you remembered: if the two are equal, output 
this number of 2's and accept, otherwise reject. 

It is immediate that on input 0™1™ this machine outputs 2™ with cer- 
tainty. However, on input 0™!™ each 2" receives probability at most 1/fc. 

That this cannot be done deterministically is straightforward: assume that 
a dfst has produced /(m) 2's after having read m O's. Because of finiteness 
there are k,l such that after reading k I's (while uq 2's were output) the 
internal state is the same as after reading I further I's (while n 2's are output). 
So, the output for input 0™!'^+''' is 2-^('"')+"'0+''", and these pairs are either all 
accepted or all rejected. Hence they are all rejected, contradicting acceptance 
for m = k + rl. □ 

By observing that the random choice at the beginning can be mimicked 
quantumly, and that all intermediate computations are in fact reversible, we 
immediately get 

Theorem 10 For arbitrary e > the relation TZi can be computed by a qfst 
with probability 1 — e. □ 

Note that this puts qfst in contrast to quantum finite automata: in |^ it was 
shown that if a language is recognized with probability strictly exceeding 7/9 
then it is possible to accept it with probability 1, i.e. reversibly deterministi- 
cally. 

Theorem 11 The relation 

7^2 = {{w2w, w) -.w e {0, 1}*} 
can be computed by a pfst and by a qfst with probability 2/3. 

Proof. We do this only for qfst (the pfst is obtained by replacing the unitaries 
involved by the stochastic matrices obtained by computing the squared mod- 
uli of the entries): let the input be x2y (other forms are rejected). With 
amplitude l/VS each go to one of three 'subprograms': either copy x to the 
output, or y (and accept), or reject without output. This works by the same 
reasoning as the proof of theorem ^ □ 

5 ... vs. Quantum Transducers 

After seeing a few examples one might wonder if everything that can be done 
by a qfst can be done by a pfst. That this is not so is shown as follows: 

Theorem 12 The relation 

7^3 = {(0™l"2^ 3"") : n ^ k A {m = kV m = n)} 
can be computed by a qfst with probability 4/7 — s, for arbitrary e > 0. 
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Theorem 13 The relation TZ^ cannot be computed by a pfst with probability 
bounded away from 1/2. In fact, not even with an isolated cutpoint. 

Proof (of theorem For a natural number I construct the following trans- 
ducer: from go go to one of the states qi, qj_b (j G {0, . . . ,1 — 1}, b E {1, 2}), 



with amplitude \/5/7 for qi and with amplitude \/2/ (71) each, for the oth- 

ers. Then proceed as follows (we assume the form of the input to be 0'"1"2'"', 
others are rejected): for qi output one 3 for each 0, and finally accept. For qj^b 
repeatedly read I O's and output j 3's (remember the remainder m mod I). 
Then repeatedly read / &'s and output l—j 3's (output nothing on the (3— &)'s). 
Compare the remainder with the one remembered, and reject if they are un- 
equal, otherwise output this number of 3's. Reading $ perform the following 
unitary on the subspace spanned by the qj^b and duplicte states qj'^b- 



Accepting are all qj',2, rejecting are all qj'.i- 

Now assume that the input does not occur as the left member in the 
relation: this means either m ^ k and m ^ n, or m = n = k. In the first 
case all the outputs in each of the ^-branches of the program are of different 
length, so get amplitude ^2/(7Z). The final step combines at most two of 
them, so any output is accepted with probability at most 4/(7^). The second 
case is more interesting: in all branches the amplitude is concentrated on the 
output 3™. The rotation V$ however is made such that the amplitude on qji^2 
cancels out, so we end up in a rejecting state qj',i. In total, any output is 
accepted with probability at most 3/7 -I- e. 

On the other hand, if the input occurs as the left member in the relation, 
exactly one of the two 6-branches of the program concentrates all amplitude 
on output 3™, whereas the other spreads it to I different lengths. This means 
that the output 3™ is accepted with probability at least {I ~ 1) • 1/{71), and 
others are accepted with probability at most 1/(7Z) each. In total, the output 
3"* is accepted with probability at least 4/7 — e, all others are accepted with 
probability at most 3/7 -I- e. □ 

Proof (of theorem\ldl): By contradiction. Suppose TZs is computed by a pfst T 
with isolated cutpoint a. The following construction computes it with prob- 
ability bounded away from 1/2: assuming a < 1/2 (the other case is similar), 
let p = ^1^"^" ■ Run one of the following subprograms probabilistically: with 
probability p output one 3 for each 0, and ignore the other symbols (we may 
assume that the input has the form 0'"1"2'^), with probability 1 — p run T 
on the input. It is easily seen that this new pfst computes the same relation 
with probability bounded away from 1/2. 

Hence, we may assume that T computes TZ with probability ip > 1/2, from 
this we shall derive a contradiction. The state set Q together with any of the 
stochastic matrices Vb, Vi , V2 is a Markov chain. We shall use the classification 
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of states for finite Markov chains (see ||]): for Q is partitioned into the 
set Ri of transient states (i.e. the probabiHty to find the process in Ri tends 
to 0) and a number of sets Sij of ergodic states (i.e. once in Sij the process 
does not leave this set, and aU states inside can be reached from each other, 
though maybe only by a number of steps). Each Sij is divided further into its 
cyclic classes Cyv {f G ^di^), Vi mapping C^v into Cyv+i. By considering 
sufficiently high powers Vf (e.g. product of all the periods dij) as transition 
matrices, all these cyclic sets become ergodic, in fact, Vf' restricted to each 
is regular. 

Using only these powers amounts to concentrating on input of the form 
0'"1"2'=, with i = i'^, which we will do from now on. Relabelling, the er- 
godic sets of Vi = Vf will be denoted Sij. Each has its unique equilibrium 
distribution, to which every initial one converges: denote it by nij. Further- 
more, there are limit probabilities a{jo) to find the process Vq in Sojg after 
long time, starting from qq. Likewise, there are limit probabilities fo(ji|jo) to 
find the process Vi in Sij-^ after long time, starting from TToj^ , and similarly 
c(i2|ji)- So, by the law of large numbers, for large enough m, n, k the proba- 
bility that Vq has passed into Sojg after -^/ro steps, after which Vi has passed 
into Sij-^ after steps, after which V2 has passed into 52^2 after \/k steps, 
is arbitrarily close to P(jo,ii, ^2) = a(jo)&(ji|jo)c(j2|ji)- (Note that these 
probabilities sum to one). 

As a consequence of the ergodic theorem (or law of large numbers), 
see ch. 4.2, in each of these events J — {jo,ji,j2) the probable num- 
ber of 3's written after the final $, is linear in to, n, k: 



as m, n, k 



00, with 



Xj{m,n,k) 



ajm + Pjn + jjk, 



and non-negative constants aj,/3j,7j. 
Since we require that for k ^ m 



r(3'''"|0™l"2''') > ip, 



it is necessary that for a set A of events J — {jo,ji,j2) 



aj + /3j = d, 7j = 0, with P{A) > 



In fact, as for J ^ ^ 



for certain sequences m,k 00, we even have 



P(J)T(3'^"|0"1™2^ J)>ip- 0(1). 
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For J e ^ it is obvious that the transducer outputs no more 3's, once 
in S'2j2. But this impUes that for m,k large enough, T(3*"|0'"l"2'=, J) is 
arbitrarily close to r(3'i™|o™l™2™, J), hence 

T(3*"|0"'l™2™) > ^-o(l), 

which implies that 

T(3*"|0'"l™2™) > (f, 

contradicting (o^™l''™2''™, 3''™) ^ 7^3. □ 

In general however, computing with isolated cutpoint is strictly weaker 
than with probability bounded away from 1/2 (observe that for finite au- 
tomata, probabilistic and quantum, recognizability with an isolated cutpoint 
is equivalent to recognizability with probability bounded away from 1/2, see 
theorem ||): 

Theorem 14 The relation 

7^4 = {(O^ra, 4') : (a = 2 ^ ? = m) A (a = 3 ^ ? = n)} 

can be computed by a pfst and by a qfst with an isolated cutpoint ( in fact, one 
arbitrarily close to 1/2 j, but not with a probability bounded away from 1/2. 

Proof. First the construction (again, only for qfst): initially branch into two 
possibilities co,ci, each with amplitude 1/^/2. Assume that the input is of 
the correct form (otherwise reject), and in state Ci output one 4 for each i, 
ignoring the (1 — i)'s. Then, if a = 2 + i, accept, if a = 3 — i, reject. It is 
easily seen that 4' is accepted with probability 1/2 if (0™l"a, 4') £ TZa, and 
with probability otherwise. 

That this cannot be done with probability above 1/2 is clear intuitively: 
the machine has to produce some output (because of memory limitations), 
but whether to output 4™ or 4" it cannot decide until seeing the last symbol. 
Formally, assume that |m — n| > 4t, with t = max^ ^ |/a('?)|- If 

rto™i"2$((go,e),0,0)[4™] =r(4™|0"l"2) > 1/2 + ,5, 

necessarily 

Tjo-n-^ ((go, e), 0, 0)[4"] + r^o-i" Hqo, e), 0, 0)[g„o„ x 4[™-2t,m+2*]] > + 6. 
But this implies 

Tto™i" ((90, e), 0, 0)[4"] + Tfo^v^Hqo, e), 0, 0)[g„o„ x 4n-2t,n+2t]^ < _ 
hence 

Tto"-i"3$((9o, e), 0, 0)[4"] = T(4"|0"l"3) < 1/2 - S, 

contradicting (0"'1"3,4") e 7^4. □ 

To conclude from these examples, however, that quantum is even better 
than probabilistic, would be premature: 
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Theorem 15 The relation 

7^5 = {{wx, x) -.we {0, 1}*, X e {0, 1}} 

cannot be computed by a qfst with an isolated cutpoint. ( Obviously it is com- 
puted by a pfst with probability 1, i.e. a dfst). 

Proof. The construction of a dfst computing the relation is straightforward. 
To show that no qfst doing this job exists, we recall from that {0, 1}*0 
is not recognized by a 1-way quantum finite automaton with probability 
bounded away from 1/2, and use theorem |^ for this language. □ 

6 Conclusion 

We introduced quantum finite state transducers, and showed some of their 
unique properties: undecidability of the emptiness problem, as opposed to de- 
terministic finite state transducers and finite automata, and incomparability 
of their power to that of probabilistic and deterministic finite state transduc- 
ers. As open questions we would like to point out primarily our conjecture 0. 
Another interesting question is whether a relation computed by a qfst with 
probability sufficiently close to 1 can be computed by a pfst. This would be 
the closest possible analog to the "7/9-theorem" from l^. 
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